PlantTFDB
Plant Transcription Factor Database
v4.0
Previous version: v3.0
Transcription Factor Information
Basic Information | Signature Domain | Sequence | 
Basic Information? help Back to Top
TF ID XP_009149597.1
Organism
Taxonomic ID
Taxonomic Lineage
cellular organisms; Eukaryota; Viridiplantae; Streptophyta; Streptophytina; Embryophyta; Tracheophyta; Euphyllophyta; Spermatophyta; Magnoliophyta; Mesangiospermae; eudicotyledons; Gunneridae; Pentapetalae; rosids; malvids; Brassicales; Brassicaceae; Brassiceae; Brassica
Family Trihelix
Protein Properties Length: 401aa    MW: 46027.2 Da    PI: 6.2351
Description Trihelix family protein
Gene Model
Gene Model ID Type Source Coding Sequence
XP_009149597.1genomeNCBIView CDS
Signature Domain? help Back to Top
Signature Domain
No. Domain Score E-value Start End HMM Start HMM End
1trihelix85.27.8e-2789183185
        trihelix   1 rWtkqevlaLiearremeerlrrgk.........lkkplWeevskkmrergferspkqCkekwenlnkrykkikegekkr.tsessstcpyfdql 85 
                     +Wt+++v++Li+a+++++++++ +          +kk++W++vsk+m+erg+++sp+qC++k+++lnkrykk++++ +++ +++++++ +++d++
  XP_009149597.1  89 KWTDKMVKLLITAVSYIGDDSSMDGgsrrkfaalQKKGKWKSVSKVMSERGYHVSPQQCEDKFNDLNKRYKKLNDMLGRGtSCQVVENPALLDSI 183
                     7**************8888887543345556677**********************************************669999998888765 PP

Protein Features ? help Back to Top
3D Structure
Database Entry ID E-value Start End InterPro ID Description
PfamPF138371.5E-2487210No hitNo description
Sequence ? help Back to Top
Protein Sequence    Length: 401 aa     Download sequence    Send to blast
MDGNFPQGGV VRGGPSSFGG FDLQTSMNQH RHSLHEGLPF TMATAQTCDH HHNVPMTEQH  60
KGDSDDDEPS FNEEGGEKST KGSPWQRVKW TDKMVKLLIT AVSYIGDDSS MDGGSRRKFA  120
ALQKKGKWKS VSKVMSERGY HVSPQQCEDK FNDLNKRYKK LNDMLGRGTS CQVVENPALL  180
DSIGYLNDKE KDDVRKIMSS KHLFYEEMCS YHNGNRLHLP HDLALQRSLK LALRNRDDHE  240
SHHQVEDVED DDGEGEEHDE YDDFTHGGCR GVHLGGGGAL KKARQSHSHE DVDHPGQQVS  300
SLECNKLSLA QMPFPQGGEE SERSALMQKQ WIESRTLQLE EQKLQIQVEL LELEKQRFRW  360
ERFSKKRDQE LERMRMENER MKLENDRMGL ELKQRELGVE L
Regulation -- PlantRegMap ? help Back to Top
Source Upstream Regulator Target Gene
PlantRegMapRetrieve-
Annotation -- Nucleotide ? help Back to Top
Source Hit ID E-value Description
GenBankAY0458501e-146AY045850.1 Arabidopsis thaliana unknown protein (At1g21200) mRNA, complete cds.
GenBankAY0913781e-146AY091378.1 Arabidopsis thaliana unknown protein (At1g21200) mRNA, complete cds.
GenBankCP0026841e-146CP002684.1 Arabidopsis thaliana chromosome 1 sequence.
GenBankF16F41e-146AC036104.3 Sequence of BAC F16F4 from Arabidopsis thaliana chromosome 1, complete sequence.
Annotation -- Protein ? help Back to Top
Source Hit ID E-value Description
RefseqXP_009149597.10.0PREDICTED: uncharacterized protein LOC103872912
TrEMBLM4EAS20.0M4EAS2_BRARP; Uncharacterized protein
STRINGBra025881.1-P0.0(Brassica rapa)
Orthologous Group ? help Back to Top
LineageOrthologous Group IDTaxa NumberGene Number
MalvidsOGEM50422752
Best hit in Arabidopsis thaliana ? help Back to Top
Hit ID E-value Description
AT1G21200.11e-141sequence-specific DNA binding transcription factors